Time Dependent Bounded Recall Strategies Are Enough to Play the Discounted Repeated Prisoners’ Dilemma∗
نویسندگان
چکیده
We show that for any discount factor, there is a natural number M such that all subgame perfect outcomes of the discounted repeated prisoners’ dilemma (pure strategies) can be obtained by subgame perfect strategies with the following property: current play depends only on the number of the time-index and on the history of the last M periods. This strong result, however, does not suffice to establish the claim of the title of this study, because we also establish that for any natural number K, there exist a discount factor and an equilibrium outcome which can only be obtained by equilibrium strategies with recall strictly higher than K. On the other hand, for cases with high discount factors, we prove that payoffs that are bounded away from the common minmax return can be approximated in equilibrium with a bounded recall strategy whose order of recall is independent of the discount factor and the fineness of the approximation. Therefore, noticing that payoffs not bounded away from the minmax return can each be approximated by the minmax payoff (sustainable in equilibrium with 0-recall) enables us to conclude that bounded recall strategies are enough to play the prisoners’ dilemma. Journal of Economic Literature Classification Numbers: C72; C73; C79
منابع مشابه
A belief-based approach to the repeated prisoners' dilemma with asymmetric private monitoring
The belief-based approach for repeated games with private monitoring studies an important class of strategies where at each point of the game, each players optimal continuation strategy is determined by the players beliefs of the private state of the opponents. This paper extends the belief-based approach to the repeated prisonersdilemma with asymmetric private monitoring. We rst nd that t...
متن کاملImitation Dynamics in the Repeated Prisoners' Dilemma: An Exploratory Example*
This paper investigates a deterministic evolutionary process governing the adoption of strategies for playing the repeated Prisoners' Dilemma. Agents playing unsuccessful strategies attempt to imitate the strategies of successful agents. Because agents' strategies are unobservable, they must be inferred from a memory of pairwise play and a knowledge of the strategy space. As a result, winning s...
متن کاملThe Extent of Cooperation∗
Compte and Postlewaite (2008, 2012) set out a model that assumed that peoples’ behavior was organized via mental systems, and employed that framework to analyze how people might cooperate in a repeated relationship with private monitoring with plausible strategies. That paper focussed on the case in which two people played a repeated prisoners dilemma game. In this paper we show that the basic ...
متن کاملThe Ghost in the Machine: Inferring Machine-Based Strategies from Observed Behavior
We model strategies of experimental subjects playing indefinitely repeated games as finite state machines. In order to do likelihood-based inference on machines using data from experiments, we add a stochastic element to the machines. As in most machine models for repeated game strategies, state transitions are deterministic. Actions, however, are random and their distribution is state dependen...
متن کاملCooperation cannot be sustained in a Discounted Repeated Prisoners’ Dilemma with Patient Short and Long Run Players
This study presents a modified version of the repeated discounted prisoners’ dilemma with long and short-run players. In our setting a short-run player does not observe the history that has occurred before he was born, and survives into next phases of the game with a probability given by the current action profile in the stage game. Thus, even though it is improbable, a short-run player may liv...
متن کامل